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Abstract 

The spectral theory of graphs provides a bridge between classical signal processing and the nascent 
field of graph signal processing. In this paper, a spectral graph analogy to Heisenberg's celebrated uncer- 
tainty principle is developed. Just as the classical result provides a tradeoff between signal localization 
in time and frequency, this result provides a fundamental tradeoff between a signal's localization on a 
graph and in its spectral domain. Using the eigenvectors of the graph Laplacian as a surrogate Fourier 
basis, quantitative definitions of graph and spectral "spreads" are given, and a complete characterization 
of the feasibility region of these two quantities is developed. In particular, the lower boundary of the 
region, referred to as the uncertainty curve, is shown to be achieved by eigenvectors associated with the 
smallest eigenvalues of an affine family of matrices. The convexity of the uncertainty curve allows it to 
be found to within e by a fast approximation algorithm requiring ©(e"^/^) typically sparse eigenvalue 
evaluations. Closed-form expressions for the uncertainty curves for some special classes of graphs are 
derived, and an accurate analytical approximation for the expected uncertainty curve of Erdos-Renyi 
random graphs is developed. These theoretical results are validated by numerical experiments, which 
also reveal an intriguing connection between diffusion processes on graphs and the uncertainty bounds. 
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I. Introduction 

Heisenberg's uncertainty principle is a cornerstone of signal processing. The simple inequality |[T]|, ||2l, 

A?A^ > \, (1) 

in which Aj and measure the "time spread" and "frequency spread" of some signal, respectively, is 
one way to precisely characterize a general principle with far-reaching consequences: that a signal cannot 
be concentrated in both time and frequency. 

In this paper, we establish analogous uncertainty principles for signals defined on graphs. The study 
of signals on graphs, and the extension of classical signal processing techniques to such nonstandard 
domains, has received growing interest in the past few years (see, e.g., llBl- lfTTI ). These studies are 
often motivated (and enabled) by the deluge of modern data collected on various technological, social, 
biological, and informational networks ifTll . The efficient acquisition, representation, and analysis of such 
high-dimensional graph-based data present challenges that should be addressed by the development of 
new signal processing theories and tools. 

A. Related Work 

Uncertainty principles date back to Heisenberg |[T], who in 1927 proved a result that Weyl and Pauli 
soon afterward generalized to ([1}. It was also shown that the bound in ([T|) is achievable by Gaussian- 
shaped functions and frequency modulations thereof. A lifetime later, analogous results were found for 
discrete-time signals as well |[T3l . |[T4l . Similar uncertainty principles have also been established on the 
unit sphere S'^ ifTSl and, in more abstract settings, on compact Riemannian manifolds |[T6l . 

In a different line of work, Donoho and Stark lITTI introduced a new concept of uncertainty related 
to signal support size. They showed that a length discrete-time signal with support set T in the time 
domain and support set W in the frequency domain satisfies IT] |W| > N. This bound is a nonlocal 
uncertainty principle — it limits the cardinality of a signal's time and frequency support sets, even if each 
is the disjoint union of far-flung subsets. Further studied in, e.g, |[T8l - ll20l . these nonlocal uncertainty 
principles laid the foundation for sparse signal recovery from partial measurements. 

In the same vein of the classical (and local) uncertainty principle stated in ([T|l, we have been studying 
the following question: given an arbitrary graph, to what extent can a signal be simultaneously localized 
on that graph and in the "frequency" domain? To obtain the spectral representation of these signals, 
we use the standard approach of treating the eigenvectors of the graph Laplacian operator [211 as a 
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Fourier basis. The Laplacian encodes a notion of smoothness on a graph ll22l and is analogous to the 
Laplace-Beltrami operator on a manifold ll23l . 

The analogy between the spectral decomposition of graph Laplacians and the standard Fourier transform 
has been used to extend the concept of bandlimited sampling to signals defined on graphs O and in 
the construction of wavelet transforms on graphs E], |0, ifTOl . In the latter case, as pointed out in |[TOl . 
a desirable property of the wavelet transforms is that the dictionary elements (i.e., wavelets) should be 
well-localized in the graph and spectral domains. Our results provide a way to precisely quantify this 
desideratum, as well as its fundamental limit. 

B. Contributions and Paper Organization 

We begin in Section |II] with a review of some basic concepts in graph theory, including the definition 
of the graph Laplacian matrix and its spectral decomposition. After justifying the use of the Laplacian 
eigenvectors as a Fourier basis on graphs, we define in Section Hl-C I the graph spread about a vertex uq, 
Ag^^(a;), and the spectral spread, /\^g{x), of a signal x defined on a graph. These two quantities, which 
we first introduced in some preliminary work [|24l . lE5l . are defined in analogy to the standard time and 
frequency spreads, respectively. 

In |[24ll . we developed a lower bound on the product of and analogous to ([T|). However, the 

bound was not tight and applied only under restrictive conditions for the graph and the signal on it. In 
ll25l we took a new approach to characterize a more general and precise relationship between the two 
kinds of uncertainty. In this paper, we continue this line of investigation, and provide a rigorous basis 
for the arguments presented in ||25| . in addition to some new results. 

The main contributions of this paper are developed in Section Jill where we characterize the uncertainty 
bound, in Section [TVl where we analyze the bound when applied to special families of graphs, and in 
Section |Vl where we reveal a connection between diffusion processes and the uncertainty bound. The 
main results are summarized as follows: 

1. Convexity of the feasible region: We prove that, when the underlying graph is connected and contains 
at least three vertices, the feasibility region of all possible pairs (Ag(a;), A^ „^(a;)) is a bounded and 
convex set. The feasibility region's convexity was stated without proof in ||25l . 

2. Characterization of the uncertainty curve: We provide a complete characterization of the curve 

luois) = mm A2„^(a;) subject to IS.l{x) = s, 

which forms the lower boundary of the feasibility region. Studying 7u„(s), which we will refer to 
as the uncertainty curve, is important because it is a fundamental bound analogous to the classical 
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uncertainty bound ([T}. Theorem [T] states that each point on the uncertainty curve is achieved by an 
eigenvector associated with the smallest eigenvalue of a particular matrix-valued function M{a). Varying 
the parameter a allows one to "trace" and obtain the entire curve 7mo('5)- A rigorous and complete proof 
of Theorem [T] is provided in this paper, extending the rough argument given in ll25l . Based the convexity 
of (s), we show in Section ITlI-C I that the sandwich algorithm 1^61 can be used to efficiently produce 
a piecewise linear approximation for the uncertainty curve that differs from the true curve by at most e 
(under a suitable error metric) and requires solving 0{e~^^^) typically sparse eigenvalue problems. 

3. Special graph families: The uncertainty curves for several special families of graphs are investigated 
in Section |IVl For complete graphs and star graphs, we derive closed-form formulas for the uncertainty 
curves 7^0(5). For Erdos-Renyi random graphs ||27| . ll28l . we develop an analytical approximation for 
the expected value of 'yuois), which is shown through experiment to be very accurate. 

4. Diffusion process on a graph: In Section IVl we reveal an intriguing connection between the classical 
uncertainty principle for functions on the real line and our results for signals on graphs. In the classical 
case, the solution to the heat equation ^ = ^ starting at t = as an impulse is a Gaussian function 
with a variance that grows linearly with t; this solution achieves the Heisenberg uncertainty bound ([T|). 
We first show experimental results indicating that a diffusion process starting with an impulse on a graph 
follows the graph uncertainty curve very closely (though not, in general, exactly.) We then prove in 
Proposition |4] that the match is exact for the special cases of a star graph or a complete graph. We further 
prove in Proposition |5] that for general graphs, under a simple condition on the distance function on the 
graph, the first two derivatives of the uncertainty curve and the curve traced by the diffusion process 
match at the point corresponding to t = 0. We conclude the paper in Section |Vl] 

II. Mathematical Formulation 
A. Graphs, Signals, and Notation 

We define a simple, undirected graph as G = {V, E), where V = {vi,V2, . . . , vn} is a set of N vertices 
and E = {ei, 62, . . . , eA/} is the set of M edges. Each edge is an unordered pair of two different vertices 
u,v S V, and we will use the notation n ~ t> to indicate that u and v are connected by an edge. The 
fundamental structure of a graph G can be captured by its adjacency matrix A = [ojjjjj, where aij = 1 
if there is an edge between vi and Vj, and aij = otherwise. As defined, the diagonal of A is always 
zero because a simple graph may contain no loops {i.e., edges connecting one vertex to itself), and A is 
symmetric because the graph is undirected. (A common generalization is to consider a weighted graph. 
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where each edge (1 < < M) is associated with a positive "weight" Wm- However, in this paper 
we only consider unweighted graphs.) 

The degree of a vertex v, denoted by deg(f ), is the number of edges incident upon that vertex. We 
define D as the diagonal matrix that has the vertex degrees on its diagonal, i.e., 



To quantify the graph-domain spread of a signal, we will need a notion of distance, denoted by d{u, v), 
between any pair of vertices u and v on the graph. A simple choice is to use the geodesic distance ||2TI . 
in which case d{u, v) is the length of the shortest path connecting the two vertices. In this work, we 
only consider connected graphs, so d{u, v) is always finite. Other distance functions have been proposed 
in the literature, including the resistance distance ||29l and the diffusion distance IH. Our subsequent 
discussions are not confined to any particular choice of the distance function. The only requirement is 
that d{u,v) should form a semi-metric: namely, d{u,v) > with equality if and only if u = v, and 
d{u, v) = d{v, u). 

A finite-energy signal defined on the graph x G is a mapping from the set of vertices to M. It 

can be treated as a vector in M^, and so any such signal will be denoted by a boldface variable. There 
is a natural inner product on ^^(G) defined by {x^y) = y^x, which induces a norm ||a;|| = \/ x^x. We 
will denote the value of x at vertex v by x{v). An impulse at u G y, i.e., a signal that has value 1 at u 
and everywhere else, will be denoted as 5^. 

B. The Laplacian Matrix and Graph Fourier Transforms 

As mentioned in Section H the graph Laplacian matrix plays an important role in this work. There are 
several different definitions of the Laplacian matrix commonly used in the literature. The unnormalized 
Laplacian matrix 11211 is given by Lunnorm == D — A, where D and A are the degree matrix in dU and the 
adjacency matrix, respectively. In this paper, we find it more convenient to use the normalized Laplacian 
matrix ll30l . defined as 



D = diag {deg(ui), deg(u2), • • • , deg(wAr)} . 



(2) 



def 



D 



1/2 



'norm — 



unnorm 



= 1 - D- 



l/2^^-l/2_ 



The choice of unnormalized or normalized Laplacian makes no essential difference to our analysis in 
Section |llll The latter is chosen because it leads to simpler expressions in some of our derivations. For 
notational simplicity, we will drop the subscript in inom. calling it L in what follows. 
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Intuitively, the Laplacian matrix is analogous to the continuous Laplacian operator — or — ^ 
on the real line. In fact, when the underlying graph is a line or a cycle, L provides the standard 
stencil approximation for the second-order differentiation operator. The same holds for higher-dimensional 
lattices. In more general settings where the graphs are formed by sampling an underlying continuous 
manifold, the Laplacian matrix converges at high sampling densities to the Laplace-Beltrami operator, a 
differential geometric analogy to the second derivative ||23]| . 

By construction, L is a real symmetric matrix. We can therefore diagonalize L as 

L = FAF'^, (3) 

def 

where F is an orthogonal matrix whose columns are the eigenvectors of L, and A = diag {Ai, A2, . . . , Aat} 
is a diagonal matrix of eigenvalues, which are all real. L can be shown to be positive semidefinite with 
rank less than N, so we can order the eigenvalues as = Ai < A2 < . . . < Aat. 

A large number of the topological properties of a graph can be inferred from the spectrum of its 
graph Laplacian ll30l . For example, a graph is connected (meaning that a path can always be found 
connecting one vertex to the other) if and only if the smallest eigenvalue (Ai = 0) has multiplicity one. 
The corresponding unit-norm eigenvector is defined by 



V Z2uev deg(«) 

where deg{v) is the degree of the vertex v. One can also show that the maximum possible eigenvalue of 
L is equal to 2, attained only by bipartite graphs. (These are graphs with two mutually exclusive subsets 
of vertices Uq and Ui such that every edge connects a vertex in Uq to a vertex in Ui.) 

Given a signal x G i^{G), we can represent it in terms of the eigenvectors of L by computing 

X = F'^x, (5) 

where x is called the graph Fourier transform of x. The matrix F represents the Fourier transform 



operatoiQ- Since F is orthogonal, FF^ = I. It follows that we can invert the Fourier transform by taking 

X = Fx. 



'There may be eigenvalues of L with multiplicity greater than one, so we should really think of the Fourier transform as the 
set of projections onto the eigenspaces associated with each unique eigenvalue. The Fourier transform defined in this way is 
unique up to unitary transformations within eigenspaces. We can choose an orthogonal basis in each eigenspace, ensuring that 
F is orthogonal. 
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Fig. 1. (a) A cycle graph with 16 vertices. Signals defined on this graph are equivalent to standard discrete, periodic signals, 
(b) Several eigenvectors of the graph Laplacian. These eigenvectors exhibit the sinusoidal characteristics of the DFT basis. 



Using the Laplacian eigenvectors as a surrogate Fourier basis is a standard approach in the literature for 
defining signal processing operations on graphs |@]-||6], ||9l, lITOl . It may not seem immediately obvious, 
though, that the analogy is a fair one. In what follows, we provide some justification for this approach. 

First, consider the special case of a cycle graph, illustrated in Figure |l(a)[ Signals defined on this 
graph can be thought of as discrete, periodic signals. The Laplacian of this graph is a circulant matrix, 
and is thus diagonalized by a discrete Fourier transform (DFT) matrix. Thus, in this case the Laplacian 
eigenbasis is exactly the common sine/cosine DFT basis. Figure |l(b)| shows several such eigenvectors, 
which exhibit sinusoidal characteristics with increasing oscillation frequencies. 

For general graphs, of course, the Laplacian eigenbasis is no longer the DFT basis. Nonetheless, the 
eigenvectors still satisfy our intuition about frequency. For example, we would Uke to say that a signal 
is "highpass" if its value changes significantly between neighboring vertices, and that it is "lowpass" if 
its value varies very little. To quantify the variation of a signal on a graph, we can construct an x i\/ 
normalized incidence matrix S EH . where each column of S corresponds to an edge e = (n, v) and has 

exactly two nonzero values: H — . in the row corresponding to vertex u, and . in the row 

corresponding to vertex v. The choice of {u, v) or {v, u), and therefore the signs involved, is arbitrary for 
each edge (though it is important that each column have one positive and one negative value.) For any 
X G the vector y = S'^x is a signal on the edges of the graph, where each edge has the difference 

between the normalized valuej^ of x on its endpoint vertices. So, in a way, y is the "derivative" of x. 



^The normalization by will limit the undue effect on the Laplacian of a vertex with a large number of incident edges. 

V dcg{ ii) 



(a) A = 0.00 



(b) A = 0.32 



(c) A = 0.69 




(d) A = 1.31 (e) A = 1.68 (f) A = 2.00 

Fig. 2. Some Laplacian eigenvectors of a graph. Straight lines indicate that values on joined vertices have the same sign; 
wavy lines indicate that there is a sign change between the joined vertices. As is evident, eigenvectors associated with larger 
eigenvalues correspond to more sign changes and thus faster variation. 



For any nonzero signal x, we can then measure its normalized variation on the graph as 

rrl|2 



x{u) x{v) \ 1 II I 2 



\y\ 



^ x^SS^x 



|a;|p 



1 r 

X Lx, (6) 



\x\p 



where the last equality {SS^ = L) is well-known and easy to verify ||2TI . When the signal x is the 
ith eigenvector /j of L, the normalized variation in ^ becomes A^, the corresponding eigenvalue. 
This justifies the usage of Laplacian eigenvalues as frequencies: eigenvectors corresponding to the higher 
eigenvalues of L are the high-variation components, and the lower eigenvalues correspond to low-variation 
components. We illustrate this fact with an example in Figure |2] 



C. Graph and Spectral Spreads 

We would like to quantify the localization of a signal on a graph in both the graph and spectral domains. 
To do so, we look to the definitions of analogous quantities in classical time-frequency analysis. For a 
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nonzero signal x G £ (M), its time spread about a point is defined by ||2l 



^lto = l^l {t-tof\xit)\'dt. (7) 



\x 

The overall time spread of x{t) is then obtained by minimizing over Iq, i.e., 



A? = min „ / (t - tof\x{t)fdt, (8) 



to \\xP 



where the minimizing value of to is given by to = jj^p- t\x{t)\'^dt. Generalizing (|7) to signals defined 
on graphs, we introduce the following definition ll24l . ll25l . 

Definition 1 (Graph spread): For a nonzero signal x G l'^{G), its graph spread about a vertex is 



where •) is the distance metric described in Section Hl-AI and P^a is a diagonal matrix defined as 

Puo = diag{d(no,'yi),c?(uo,f2), • • .,d{uQ,VN)} ■ (10) 
Remark: Similar to dSjl, we can also define the overall {i.e., global) graph spread of x as 

For our subsequent analysis on uncertainty principles though, we will focus on the local graph spread 
{i.e., about a particular vertex uq) as defined in Unlike classical domains such as the real line whose 
topology is shift-invariant, the "landscape" of a graph can look very different around different vertices. 
Thus, it is important to explicitly specify the center vertex uq when considering the graph spread and 
uncertainty principles. If needed, global versions can always be obtained through finite minimization over 
all no G V. 

The spectral spread of a signal defined on graphs requires more thought. In the classical case, the 
frequency spread of a real- valued signal x{t) G is given by ||2l 



oo 



oj'\x{uj)\' — , (12) 
zvr 



where x{ijj) is the Fourier transform of x{t). This expression is simpler than that of the time spread in (|7]) 
because the frequency center is chosen to he ujq = Q due to the symmetry of the Fourier transforms of 
real-valued signals. On recognizing that uj'^x{uj) is the Fourier transform of ^^x{t) and using Parseval's 
identity, we can rewrite ([T2] | as 

= / x{t)-^x{t)dt. (13) 
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Generalizing to the graph case, treating L as analogous to the operator — we obtain the following 
definition ll24]l. lIBl. 

Definition 2 (Spectral spread): For a nonzero signal x G ^^(G), we define its spectral spread as 

^l[x)''^^x''Lx (14) 
I N 

~ Ti — TTo / , ^ri \^n\ ■, (15) 
" " n=l 

where the second equality follows from the decomposition of L in Q and the definition of graph Fourier 
transforms in 

Remark: The equivalent definitions in ([141 ) and ( fTSl ) reveal two different facets of the spectral spread: 
while ([TSl l perhaps more clearly justifies the "spectral" nature of A^(a;), the form in ([141 ) shows that 
A^(x) can also be understood as the normalized variation of x introduced in (O. 

III. Uncertainty Principles: Bounds and Characterizations 

Intuitively, we can reason that there should exist a tradeoff between the graph and spectral spreads of 
a signal. If the graph spread is small, then the signal must resemble an impulse centered at some 
vertex; in this case, the normalized variation {i.e., the spectral spread A^) should be high. If instead 
Ag is small, then the signal cannot vary too quickly; it will thus take a long distance for the signal 
values to drop significantly from the peak value, in which case the graph spread will be high. How can 
one precisely quantify the above intuition? What are the signals with a given spectral spread that are 
maximally localized on the graph? These are the fundamental questions addressed in this section. 

A. The Feasibility Region 

In the classical uncertainty principle, not all pairs of time-frequency spreads (Aj , A^) are achievable, 
and the tradeoff is quantified by the celebrated inequality A^A^ > i, which holds for any nonzero 
function x{t) G £^(M) |[ll, lUl. Furthermore, this bound is tight. In fact, any pair of the form (A^, A^) = 
(c, ^) for c > is achievable by a function of the form x{t) = exp (^—j^ 

In a similar way, we are interested in characterizing the following feasibility region 

V^/^{{s,g):Al{x) = s, ^l^S^) = g 

for some nonzero x G 1^{G)}, (16) 

containing all pairs of the form (A^,A^„^) that are achievable on a graph G, using uq as the center 
vertex. 
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Proposition 1: Let be the feasibility region for a connected graph G with vertices. Then the 
following properties hold: 

(a) is a closed subset of [0, Aat] x [0,<S^(mo)], where A^r < 2 is the largest eigenvalue of graph 

def 

Laplacian L, and 8g{'^q) = maxt,gy d(no, f ) is the eccentricity of the center vertex uq. 

(b) intersects the horizontal axis at exactly one point, (1,0), and the vertical axis at exactly one 
point, (0, /f -P^o/i), where fi is the eigenvector defined in dJ]). 

(c) The points {1,8q{uq)) and (Atv, /^P^^/jy), where /jy is any unit-norm eigenvector associated 
with Atv, belong to 

(d) is a convex set if the number of vertices > 3. 

Proof: (a) The graph and spectral spreads of any nonzero signal can be bounded by the largest and 
smallest eigenvalues of L and P^^^. More precisely, using the Rayleigh inequalities lISTI . we have 

x^Lx 

= Ai < < Xn 

X^ X 

and, similarly, 

is compact, and therefore closed, because it is the image of a compact set under a continuous 
transform ll32l . Specifically, if we take the unit sphere in M^, a compact set, and apply the map f : x ^ 
(A^(a;), „^(a;)), which is continuous on the unit sphere, we get the whole uncertainty region. 

(b) A signal has zero graph spread {i.e., A^^^ (a;) = 0) if and only if it is an impulse supported on uq, 
i.e., x{v) = ciiv = uq and x{v) = otherwise, for some nonzero scalar c. Meanwhile, using (fT4b and Q, 
one can verify that the normalized variation (and thus the spectral spread A^) of such impulse signals is 
equal to 1. It follows that (1,0) is the only point that lies at the intersection of and the horizontal axis. 
Next, consider the intersection of Dug with the vertical axis. Since Ag(a;) = a;^La;/||a;p > Ai = 0, the 
spectral spread A^(a;) = if and only if x is an eigenvector of L associated with the smallest eigenvalue 
Ai = 0. (See dUl for an example.) Such eigenvectors are also unique (up to scalar multiplications) since 
the smallest eigenvalue Ai of connected graphs always has multiplicity one ll30l . 

(c) The inclusion of (Aat, /^P^^/^) in Vug is clear. For the first point {1,£q{uo)), consider an 
impulse function supported at the furthest vertex on the graph from uq. Similar to (b), we can compute 
its spectral and graph spreads as A^ = 1 and A^ „^ = £q{uq), respectively. 

(d) See Appendix |Al ■ 
Remark: Figure |3] illustrates a typical feasibility region Vu„ as specified by Proposition [2 The bound- 

edness and convexity of imply that the entire region can be completely characterized by its upper 
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Fig. 3. The feasibility region Dug for the spectral and graph spreads. is a bounded and convex set that intersects the 
horizontal (and vertical) axis at exactly one point. The lower boundary of Dug can be implicitly computed by considering 
supporting lines of varying slopes. The achievable region must lie in the half-plane above the supporting line (found by solving 
an eigenvalue problem.) 

and lower boundaries: any pair between the two boundaries must also be achievable. Furthermore, the 
lower boundary must be convex and the upper boundary must be concave. 

B. The Uncertainty Curve 

In what follows, we will describe a technique for computing the lower boundary curve of Vu^, which 
we call the uncertainty curve. 

Definition 3: Given a connected graph G, the uncertainty curve with respect to a center vertex uq is 

7no(s) =*^nnn Ag „^,(a;) subject to A^(a3) = s 

= mill X Pua^ subject to a; x = 1 and x Lx = s, (17) 

for all s G [0, Aat]. 

Remark: We could also define and study the upper boundary curve of V^g in a similar way. We choose 
to focus on the lower boundary curve because it provides an uncertainty bound analogous to the classical 
bound ([T|). We will say that a signal x achieves the uncertainty curve if A"^ ^^^{x) = 7u„(As(a;)^). 

We note that dTTl ) is a quadratically constrained quadratic program ||33l . The equality constraints make 
the problem nonconvex. On differentiating the corresponding Lagrangian function 

A{x; a, A) =* x^Pf^^x — a{x^ Lx — s) — \{x^x — 1), 
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we see that the optimal solution x* to (fTTl ) must satisfy 

{Pl^ - aL)x* = \x* 

for some a, A G M. If we treat a as being fixed, then the above equality becomes an eigenvalue problem. 
This observation leads us to study the matrix-valued function 

M{a)=Pl^^-aL. (18) 

For any a, the smallest eigenvalue of M{a), denoted by 

9(a) = A,nin(M(a)), 

and its associated eigenspace, denoted by SS{a), are key to our analysis of the uncertainty curve 7^0 (s)- 

Proposition 2: For any a G M and any unit-norm eigenvector v in SS{a), the point {v^Lv, v'^Pf^^v) 
is on 7„o(s). 

Proof: Let x be an arbitrary signal with ||a;|| = 1. By definition, ugi^) ~ aA^(a;) = x'^ M{a)x. 
Applying Rayleigh's inequality to M{a) thus leads to 

Al^^ix)-aAlix)>qia) (19) 
= v^pIv - av'^Lv, (20) 

where (l20l ) comes from the fact that v is an eigenvector associated with q{a). Let s = v'^Lv. On 
specializing the relationship (l20l) to those signals x satisfying A^(a;) = s, we have 

Af, „ (x) > v^pI V, 

g,Uo \ J — Mo ' 

which indicates that the point {v'^ Lv , v"^ Pf^^v) must lie on the uncertainty curve ^uo{s)- ■ 
There is an interesting geometric interpretation of the above derivations: as illustrated in Figure |3] for 
any a, the inequality in ( fT9l ) defines a half-plane in which must lie. The boundary of the half-plane, 
a line of slope a defined by 

provides a tight lower bound to V^g- Varying the values of a generates a family of such half -planes, the 
intersection of which contains P^o- For readers familiar with convex analysis, we note that q{a) is the 
Legendre transform of juo{s) lf33l . 

Proposition |2] guarantees that any nonzero eigenvector of M{a) associated with the smallest eigenvalue 
q{a) generates a point on the curve 7^^,(5). Next, we will show that the converse is also true: every point 
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on 7mo(s) is achieved by an eigenvector in SS{a) for some a. To establish this result, we need to 
introduce the following two functions: 

h+(a) = max x^Lx and /i_(a) =* min x^Lx, (21) 

a;eS5(a): ||a;|| = l a;eS5(a): ||a;||=l 

which measure, respectively, the maximum and minimum spectral spread (i.e., the horizontal coordinate 
on the s-g plane) that can be achieved by eigenvectors in SS{a). 
Lemma 1: The following properties hold for /i+(a) and /i_(a). 

(a) They are increasing functions, i.e., h^{ai) < /i+(a2) and h-{ai) < /i_(a2) for all ai < 02- 

(b) They have the same limits as |a| tends to infinity: 

lim = lim = 0, (22) 

a—^—oo a—^—oo 

and 

lim h^(a) = lim h-{a) = Xn- (23) 

a— >+oo a— >+oo 

(c) On any finite interval [a,b], the functions h^{a) and h^{a) differ on at most a finite number of 

def 

points, denoted hy B = {Pi, 132, ■ ■ ■ , /3fc} for some > 0. Except for these points, /i+(q) and h-{a) 
coincide, are continuous, and satisfy 

h+{a) = h^{a) = -q'{a), for all a G [a, b] \ B, (24) 

where q'{a) is the derivative of q{a). At the points, if any, where they do differ, /i+(q) and h-{a) 
have jump discontinuities. Moreover, for all /3 G 

h^{/3) = lim /i+(a) > lim h-[a) = h-{/3), 

where the limits are taken as a approaches /3 from the positive and negative sides, respectively. 
Proof: See Appendix iBl ■ 

The results of Lemma [U are illustrated in Figure BJa), where we plot a typical example of /i+(a) and 
/i_(a): as a increases from —00 to +00, the values of the functions increase from to Aa?. Within any 
finite interval, /i+(a) = except at a finite number of points {e.g., the point /? in the figure). At 

these "jump points", h+{a) is right-continuous, whereas /i_(a) is left-continuous. 

Since we are only considering connected graphs, Ai = has multiplicity 1, and so fi is the unique 
vector (up to scaling) that achieves the uncertainty curve with = 0. At the other end, A^r may have 
multiplicity, but some vector in its eigenspace will achieve the uncertainty curve with = X^. For 
values of s G (0, Amax)» we can use the following theorem to precisely characterize vectors that achieve 
the uncertainty curve at s. 
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Fig. 4. The mapping of the eigenvectors in S{a) onto the s-g plane is shown. In (a), h+{a) and h-{a) are plotted against 
a (they coincide except at jumps in the plot.) They are, respectively, the maximum and minimum spectral spreads of elements 
of the eigenspace S{a). Any element of S{a) determines a point on the graph of 7tio(s). When S{a) is of dimension greater 
than one, it corresponds to a line segment on 'juais). 



Theorem 1: A signal x G with A'^{x) G (0, Amax) achieves the uncertainty curve, i.e., Ag^^^{x) = 

7(A^(ic)), if and only if it is a nonzero eigenvector in SS{a) for some a. 

Proof: The "if" direction has been established in Proposition |2] To prove the "only if" direction, 
we will show that for any signal x G that achieves the uncertainty curve, there is an a and a 

unit-norm eigenvector v G SS{a) such that v'-^Lv = A^(a;). Since both x and v lie on the uncertainty 
curve (with the former given as an assumption and the latter guaranteed by Proposition |2l), we have 

x'^M{a)x = A^^^^ix) - aAl{x) = v^M{a)v = q{a). 

Now, since q{a) is the smallest eigenvalue of M{a), the equality above implies that x must also be 
an eigenvector associated with q{a). In fact, x will be equal to v (up to a scalar multiple) if q{a) has 
multiplicity one. The remainder of the proof verifies the claim, namely, for any s G (0, A at) we can find 
an a and a unit-norm eigenvector v G SS{a) such that v'^Lv = s. 

By part (b) of Lemma [T] we can always find some a < b such that h^{a) < s < h^{b). Furthermore, 
part (c) of Lemma [T] ensures that, within the interval [a,b], the two functions /i+(a) and h^{a) differ 
(and are discontinuous) on at most a finite number of points. For notational simplicity, and without loss 
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of generality, we assume that there is only one such discontinuity point, denoted by /? G [a, 6]. As shown 
in Figure m the interval [h-{a), h+{b)] can now be written as the union of three subintervals 

[h.{a),h.{f3)), [/i_(/3),/i+(/3)], and(/i+(/3),/i+(6)], 

to one of which s must belong. 

We first consider the case where s G [h^ (a), /i_ (/3)). Lemma[T]says that /i„ (a) is a continuous function 
on [a,P]. By the intermediate value theorem, there exists some oq G [a, /3] such that /i_(ao) = s. By 
definition, /i_(ao) = ™iiiz65S{cio); ||2||=i Lz. Since the eigenspace 55(00) has finite dimensions, the 
minimization can always be achieved by some unit-norm eigenvector v G 55(qo), i.e., s = h-{ao) = 
v^Lv. The same line of reasoning can be used when s belongs to the third subinterval, (/i+(/3), h^{b)]. 
This leaves us with the remaining case when s G /i+ (/?)]. Let 

def T r 1 def . T r 

t>+ = argmax z Lz and v = argmm z Lz, 

zeSSil3):\\z\\ = l zeSS(j3):\\z\\ = l 

and consider the vector-valued function y{6) = ^°l^^l'l'^(^2e)^^^^]'" ^ ' defined for 9 G [0, 7r/2]. The denomi- 
nator is nonzero for every 6, since / [otherwise we would have h-.{j3) = /i+(/3)]. So y{9) is of 
unit norm and is a continuous function of 9. It also must belong to SS{I3) since it is a linear combination 
of two elements of the subspace. Furthermore, y{0)'^ Ly{0) = h^{/3) and y(7r/2)-^Ly(7r/2) = h^{j3). 
By the intermediate value theorem, y{9) for 9 G [0, 7r/2] achieves all the values in between. In particular, 
there exists some 9q such that y{9Q)'^ Ly{9Q) = s. We note that since every element of SS{(3) achieves 
a point on the line g — /3s = <?(/?), this interpolation procedure amounts to including the straight line 
segment between the two endpoints as part of the uncertainty curve. ■ 
Remark: If SS{a) is one-dimensional for every a G [a, b], or more generally if there is a single distinct 
eigenvalue function that achieves the minimum on [a,b], then from Theorem [T] as well as Lemma [T] and its 
proof, q{a) is analytic on [a, b] and the corresponding portion of the uncertainty curve can be expressed 
in parametric form as 

s{a) = —q'{a) 

(25) 

luAs) = q{a) - aq'{a), 

where the first equality is due to (|24] | and the second is due to the fact that any vector in S{a) must 
achieve a point on the line g — as = q{a). 

In general. Theorem [T] and its proof justify a way to obtain the uncertainty curve: for every a, we 
find the eigenvectors associated with the smallest eigenvalue of M{a). These eigenvectors will give us 
points on 7^0(5). By "sweeping" the values of a from —00 to 00, the entire curve can then be traced. 
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C. Fast Approximation Algorithm 

In practice, of course, we must sample and work with a finite set of a's, wliicli lead to an approximation 
of the true curve. In what follows, we describe an efficient algorithm that can compute an approximation — 
more specifically, an upper and lower bound — of (s) with any desired accuracy. 

Since 7^0(5) is the lower boundary of the convex region Vug, it is itself a convex function. We can 
therefore use the sandwich algorithm described in || 26l to approximate it. The algorithm can be easily 
understood by studying Figure 5(a)[ consider a segment of the curve 7^^ (s) with two end points A and 
B, whose coordinates are denoted by (0,7^^(0)) and (6,7^0 (6)), respectively. Also given are supporting 
linej^ containing the end points, represented by the line segments AD and BD. Due to the convexity of 
7uq(s), the chord that connects Ato B must lie entirely above the curve and thus form an upper bound. 
Similarly, the combination of AD and BD forms a piecewise linear lower bound of 7^0 (s). 

To refine these two initial bounds, let a be the slope of the chord, i.e., 

a = . (26) 

b — a 

Computing the smallest eigenvalue q{a) and the associated eigenvectors of M{a), we can obtain a new 
point on the curve, denoted by C in Figure [5(a)| The s-g coordinates of C are {v'^Lv, v'^P'^^^v), where 
V is a unit-norm element in the eigenspace SS{a). Our previous analysis in Section ITlI-B] — in particular, 
( fT9l ) and (|20l ) — guarantees that the line 

g-as = q{a), 



supporting line is a line that intersects a curve but does not separate any two points on the curve 1331 . 
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which passes through C, must be a supporting line of ^ua{s)- In other words, a is a subderivative of 
^ua (s) at point C, and is the derivative if it exists. This property, together with the construction of a in 
(l26l) . also ensures that C is always located between A and B. As illustrated in the figure, the curve is 
now bounded above by joining the three points {A, C and B), and it is bounded below by joining the 
three supporting lines (AE, EF and FB). 

The above procedure can then be repeated, in a recursive fashion, on the two curve segments AC 
and CB. Each stage of the recursion roughly doubles the number of points in the approximation, and 
we proceed until a fixed number of refinements have been computed. Figure |5(b)| shows the lower and 
upper bounds of ^uA^) obtained by starting from two initial points (0, ffPuafi) and {Xn, f'^Plaof n) 
and running the algorithm for two refinement iterations, involving a total of five eigenvalue evaluations 
(each corresponding to a single point drawn on the curve.) We can see that the proposed algorithm starts 
producing reasonable approximations of (s) after just a small number of steps. 

Let r]^\-) and rj^"\-) denote, respectively, the upper and lower bounds the algorithm generates after 
n eigenvalue evaluations. We measure the quality of approximation by computing the Hausdorff distance 
ll26l between these two bounds, defined as 



Informally, the Hausdorff distance d{n) is small if the two bounding curves are close to each other. The 
following theorem, which follows directly from ll26l Theorem 3], shows that d{n) is of order 

Theorem 2: Let e > be any preset precision level. To get d{n) < e, it is sufficient to run the 



approximation algorithm until we have n > max where W = y + £q{uq). 



Remark: In many practical applications, the underlying graph G is large but sparse. Correspondingly, 
M{-) are sparse matrices. Obtaining an approximation of 7^0 (s) within a given precision e then boils 
down to computing {e.g., via iterative power methods) the smallest eigenvalue and an associated eigen- 
vector of about 0(\l^/e) sparse matrices. 

Instead of approximating the whole curve, we may wish to find 7^^ (s) only for some particular value 
of s, as well as the signal that achieves it. The sandwich algorithm can be modified slightly to this end. 
At each step of the approximation procedure, we can choose to refine only the segment containing s, 
ignoring all other segments. Iterating in this way, we will find both 7„j, (s) and the vector with spectral 
spread s that achieves the bound. 



Sl S2 L 




AGASKAR AND LU: A SPECTRAL GRAPH UNCERTAINTY PRINCIPLE 



19 



IV. The Uncertainty Curve for Special Graph Families 

The uncertainty curves for several standard graph famiUes are analyzed in this section. The structure and 
regularity of complete graphs and star graphs make it possible to find closed-form expressions for their 
corresponding curves. For Erdos-Renyi random graphs lETI . |[28l . we will derive and compute analytical 
approximations for the expected {i.e., mean) curves under different parameters. Throughout this section, 
the distance metric d[-, ■) is assumed to be the geodesic distance. 



A. Complete Graphs 

A complete graph is a fully-connected graph in which every pair of distinct vertices is connected by an 
edge |[T2l . It is often used to model fully-connected subgraphs, or cliques, in real-world networks ll34l . 
The Laplacian matrix of a complete graph with vertices is given by 



Lij 



1, if i= j; 

(27) 



^ otherwise, 



i.e., the diagonal of L is all 1, and the off-diagonal elements are all equal to —jf^- It is easy to verify 
that L has eigenvalue with multiplicity 1, and eigenvalue with multiplicity — 1. Without loss 
of generality, we can choose the first vertex as the center The diagonal distance matrix is then 

Pu, = diag{0, 1,1,..., 1}. (28) 

We would like to compute the uncertainty curve 7(5) for a complete graph for s G [0, j^]- First, we 
will show that any vector that achieves the uncertainty curve has a special form. 

Proposition 3: For a complete graph, suppose x achieves the uncertainty curve. Then x is of the form 

X = [xi,X2,X2,. . . ,X2f. (29) 

Proof: See Appendix ICl ■ 
The result in Proposition |3] suggests that, for complete graphs, we need only consider vectors of the 
form in (|29] l. Enforcing the unit-norm constraint on (|29] , we can further simplify these eigenvectors as 
x{0) = [cos 6, Jn^ 1 ' ' ' ' ' Jn^ 1 ^'^ some parameter 9. The graph spread in this case is given 

by 

-2/1 1 1 

, = > 1 = cos 29, 
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where the second equaUty is due to a standard trigonometric identity. Meanwhile, by using the variational 
form in (|6), we can compute the spectral spread as 



9 , , , , / COS 9 sin C ^ 

.1 -lAT 1 . 



Af = (iV 



N 1 N -2 

sin 20 H cos 26* . (30) 



2N -2 y/N - 1 2iV - 2 

Combining these two expressions and using the identity sin^ 29 + cos^ 20 = 1, we can see that the 

uncertainty curve 7„„ (s) is part of the ellipse given by 

(2A^,.„ - 1)^ + {N-l) ^A? + ^A^,„„ - 1 j =1. (31) 

For fixed s = A^, solving for 7^0 (s) = ^^g,uo picking the smaller of the two solutions to (OTI ll leads 
to 

7no (s) = 



N-s{N-2)- 2^1 - (AT - 2)(3 - 1) - (AT - 

4 + (iV-2)2/(A^- 1) ' ^ ^ 

for < s < Tv/rj- Thus, the curve is the entire lower half of the ellipse given by (|3T| ). When the graph 
is large {i.e., N » 1), this curve converges to a straight line (s) = 1 — s in the s-g plane. 

B. Star Graphs 

A star graph 11211 with vertices has one central vertex and — 1 leaves, each connected by a single 
edge to the center. It is a prototypical example of a hub in a network ||34l . The Laplacian matrix can be 
expressed in block form as 

L=[ -7^1^-1 (33) 

where Iat^i is the {N — l)-vector of all ones, and I^-i is the {N — 1) x {N — 1) identity matrix. Since 
the graph is bipartite, the largest eigenvalue of L is always equal to 2 |[30l . Let uq be the center of the 
star; the diagonal distance matrix is again given by Pu„ = diag{0, 1,1,..., 1}. 

Just as for the complete graph, we can always represent signals that achieve the uncertainty curve on 
star graphs as x{9) = [cos 9, , , . . . , ■^==\^ for some 9 (see the remark in Appendix ICl for 
justification). Now, the graph spread is given by A^^^ = sin^ ^ — \ ~ \ cos 20; again, by using the 
spectral spread can be computed as 

cos 9 sin 9 



Ai = {N-l) 
= 1 - sin 20. 
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The lower bound curve is thus the lower part of the ellipse defined by 

(A^-1)^ + (2A^,,^- 1)2 = 1. 

Written explicitly, the curve is 

7uo(5) = ^(l- VK2-S)), for0<s<2. (34) 
We note that, unlike the complete graph case, this curve does not depend on the size of the graph. 

C. Erdds-Renyi Random Graphs 

An Erdos-Renyi random graph G is generated by taking vertices and selecting each pair of vertices 
to be an edge with probability p, independent of all other potential edges. We denote by Qp{N,p) the 
statistical ensemble of the resulting graphs. First studied by Erdos and Renyi |[27l . ll28l . Qp{N,p) may 
be the simplest random graph model. Although they do not capture all of the behaviors of real networks, 
Erdos-Renyi graphs are an excellent theoretical model because they lend themselves to tractable analysis. 

To study the properties of the uncertainty curves for Erdos-Renyi graphs, we generated several realiza- 
tions drawn from Qp{N,p) and used the approximation algorithm described in Section ITlI-CI to compute 
their uncertainty curves. It quickly emerged that the curves for different realizations generated with the 
same parameters were, for reasonable sizes of A^, tightly clustered around a common mean curve. This 
is illustrated in Figure |6l which shows the mean curves and estimated standard deviations for several 
parameter values. In what follows, we develop an analytic approximation for computing the expected 
{i.e. mean) uncertainty curve for different choices of parameters A^ and p. 

Recall from the definition of the uncertainty curve that we are trying to approximate the expectation 

of 

77yn(s) = min x P., x subject to \\x\\ = 1 and x Lx = s (35) 

over random graphs drawn from Qp{N,p). The matrices P^^ and L and the optimal vector x that 
solves the minimization problem are all random quantities. Since (s) is obtained through a nonconvex 
quadratic program, there is generally no closed-form expressions linking "fuo{s) to and L. As a 
result, directly computing the expectation of (s) will be difficult. To make the problem tractable, we 
proceed by replacing x'^Pl^x and x^Lx in (l35l ) with their respective expected values and minimizing 
after the fact. Later we will see that this strategy turns out to be very effective in generating accurate 
approximations. 



22 




(a) N = 1000,p = 0.03 



(b) Ar = 1000,p = 0.05 



(c) iV = 2000,p = 0.03 



(d) AT = 2000, p = 0.05 



Fig. 6. Uncertainty curves for Erdos-Renyi graphs. For each choice of {N,p) parameters, 1000 Erdos-Renyi graphs were 
generated and their uncertainty curves for s G [0, 1] were computed using the sandwich approximation procedure described in 
Section Unl The geodesic distance function is used. Each curve was interpolated to generate comparable curves on a regular grid. 
For each parameter choice, the mean and standard deviation of the interpolated curve was computed over the ensemble. The mean 
curve is plotted on the graphs as a solid line, with shaded areas illustrating the three standard deviation levels. Meanwhile, the 
approximate expected value computed before generating the curves is plotted as a dashed red line. The shape of the uncertainty 
curve is clearly quite stable across each ensemble, especially as A*' and p increase, and the approximate expectation curve is 
quite accurate. 



Another observation that emerged from our numerical experiment was a characteristic of the vectors 
that achieved the bound with s < 1: these vectors were all approximately radial functions, i.e., the value 
at any vertex v was a function of d{uo,v). Because this simplifies the analysis greatly, we will only 
consider the part of the curve with s < 1, which corresponds to signals that are maximally localized 
in both the graph and spectral domains. We will explicitly incorporate this assumption by focusing on 
vectors whose values depend only on distance from uq. In this case, the original A^-dimensional vector 
X G £'^{G) can be represented by a smaller vector y, with x{v) = y{d{uQ,v)). The dimensionality of y 
is equal to £g{uq) + 1, where £g{uq) is the eccentricity of the center vertex. We note that £g{uq) is a 
random variable that in principle can take any value between and — 1. When N is large, however, 
we can find a small number (fmax ~ ^^(log A^) such that £g{uq) < dmax with high probability ll35l . So, 
in what follows, we will treat y as a vector in M'^"">''+i. 

For a given, deterministic y, we will compute the expectations (over the randomness of the graph 
model) of and To that end, we define as the probability that a vertex v chosen 
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uniformly at random from V\{uq} has a distance d{uo, v) = d. The special case /i = p is easy to verify. 



For the other cases, we will use the results of Blondel et al. 11361 . who developed a recursive formulaci to 
find (approximate) analytical expressions of the entire sequence {fd}- The expected number of vertices 
at a distance d > 1 is (A^ — l)fd- It follows that, for fixed y, 



E ||a;| 



E 



^ y{d{uQ,v)f 



.vdV 



\0) + Y,{N-l)f,y'{k) 



k=l 



and 



E 



^ d{uo,vfx{vf 



.v£V 



Y,k\N-l)hy\k), 



(36) 



(37) 



k=l 



where the approximations are due to the truncation of y at dimension dm 
The spectral spread is more complicated. We start with the expression 

\ 2 



x^Lx = E f - 



X[U] 



X[V] 



\l deg(n) Vdeg(w) / 

By assuming that the degree of every vertex is approximately equal to its expectation (N — 1)^3, we write 

1 



rp 

X Lx 



^{x{u) -x{v)f. 



(38) 



(iV-l)p,^ 

Recall that x{v) = y{d{uo,v)). Consequently, the only edges that contribute to (|38T l are those between 
vertices at different distances from uq. Since a vertex at distance k can only be connected to vertices at 
a distance of — 1 and + 1, we simply need to characterize fc+i, the expected number of edges 
from vertices at a distance k to vertices at a distance A; + 1, for A; = to dmax — 1- The expected value 
of the spectral spread can then be obtained as 



E [x'^Lx] 



1 



{N - l)p 



J] Mk,k+i{yik + 1) - yik)y 



(39) 



A;=0 



It is easy to see that Mq^i = {N — l)p, since that is simply the expected number of edges incident upon 
uq. The other terms of M^^k+i can be approximated through a recurrence relation. First, we observe that 
the expected number of vertices at distance k is {N — l)fk and the expected number of vertices not at 
distance k (not counting uq) is {N — 1)(1 — fk). Thus, we can approximate the total number of potential 
edges between these two disjoint sets of vertices is {N — 1)^/^(1 — fk)- Since each potential edge will 



Unlike our construction, they allowed v to be any vertex in V , including uq; thus, in their result, fo — j^, and all other 
values of fd differ from ours by a factor of ^^-^ . For large A'^ the difference is negligible. 
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be chosen with probability p, we get that M^-i^k + ^fc,fc+i ~ — — fk), which leads to the 

following approximate recurrence relation 

Mo,i = {N- l)p 

(40) 

Mk,k+i -{N- Ifpfkil - fk) - Mfc„i.fe, for k > 1. 

v 

The expressions in (l36l ). (|37] ). and (|39l ) show that the expected values of the squared norm, graph 
spread, and spectral spread are all nonnegative quadratic forms involving the vector y G ]g'^max+i jj- 
foUows that we can write 

E [\\xf] « y^Hay, E [x'^Pu.x] « y^Ply, and E [x^Pu^x] « y^Lay, 

for some positive semidefinite matrices Ha, Pa, La, respectively. Substituting these expectations for their 
(random) counterparts in (l35T l. we compute our approximation of the expected uncertainty curve, (s), 
as 

%o{s) = min y^Ply subject to y^ Hay = I and y'^ Lay = s. (41) 

We note that this minimization problem (a quadratic program with quadratic constraints) has exactly the 
same mathematical structure as the one previously studied in (fTTl) . Using the same techniques derived in 
Section IIII-B[ we can show that any solution to (|4T]) satisfies the (generalized) eigenvalue problem 

{Pi - aLa)y = T^in{a)Hay (42) 

for some value of a, where T^in{a) is the smallest (generalized) eigenvalue. As before, we can construct 
a sandwich approximation to the curve by solving (|42] | for a sequence of a's. 

Despite the various approximations made along the way, the analytical solution obtained in (|4TI ) fits 
experiment remarkably well. As illustrated in Figure |6] the resulting analytic curves (shown in dashed 
lines) match almost perfectly with the observed sample average (shown in solid lines). We note that the 
matrices in (|4TI ) are of size dmax x c^max> which is much smaller than N x N. For example, for the 
Qp{lO^, 10~^) model, we would have c?max = 4 (the smallest d such that 1 — Ylk=i fk < 10^^.) 

Thus, the analytic approximation derived here can be computed far faster than the actual uncertainty 
curve for any realization of the model, and does not itself require any realization to be generated. 

V. Diffusion Processes and Uncertainty Bounds 

In constructing dictionaries to represent signals on graphs, one would like the dictionary elements 
to be localized in both graph and spectral domains. Quantifying the signal localization in these two 
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Fig. 7. (a) Network of football games between NCAA Division I- A teams in the 2000 regular season II37I : (b) Spectral spread 
versus graph spread on this graph. Solid line: computed uncertainty curve "/uq{s). Triangles: scaling functions in diffusion 
wavelets 0. Squares: scaling functions in spectral graph wavelet transform (SGWT) ||6]. (The true SGWT scaling functions 
are not related to the wavelet functions by a two-scale relation; here, we simply take the cumulative sum of the coarsest-level 
scaling function and higher-level wavelet functions.) 



domains and studying their fundamental tradeoff have been one of the motivations of this work. To test 
the theoretical results and the computational algorithm presented in Section |llll we consider two graph 
wavelet transforms in the hterature: the diffusion wavelets of Coifman and Maggioni H and the spectral 
graph wavelet transform of Hammond et al. [6l. The localization properties of these two constructions 



are studied on a graph visualized in Figure |7(a)| based on the network of football games played in the 
2000 regular season by NCAA Division I- A teams ||37l . While the spectral graph wavelet transform does 
not downsample the graph, the diffusion wavelet transform does. In our experiment, the center vertex 
Uo is chosen to be one of the vertices that remain in the downsampled graph at the coarsest level of the 
diffusion wavelet transform. 

Figure |7(b)| shows several scaling functions from both constructions plotted against the uncertainty 
curve 7«o(s), with the latter obtained by using the sandwich algorithm in Section IlII-CI In this and 
all subsequent experiments, we use eight refinement iterations (for a total of 257 sparse eigenvalue 
evaluations) to plot the uncertainty curves. At this level, we find the lower and upper approximations 
of 7uo(*) to be visually indistinguishable. As predicted, both the spectral graph wavelet and diffusion 
wavelet constructions result in basis elements that obey the computed bound. In fact, they follow the 
curve quite well. 
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The diffusion wavelets are based on the evolution of a discrete time diffusion process on a graph. In 
the classical setting, where the signal domain is the real line, there is a strong connection between the 
continuous time diffusion process and the Heisenberg uncertainty curve: to see this, consider a diffusion 
{i.e. heat) equation 

— = — , (43) 

dt dy"^ ' 

where u{y, t) is a function of y, t G M. This equation governs the conduction of heat in physical processes, 
and its solution was the original motivation for Fourier analysis. The fundamental solution to ( |43] ). i.e., 
the solution with the initial condition that n(y, 0) = 5{y — yo) for a given yo, is the Gaussian kernel 

K{t,y,yo) = ^=e . 
V47rt 



Thus, if we start with an impulse and evolve according to (|43l i. at time t we get a function with time spread 
t and frequency spread ^, achieving the classical Heisenberg uncertainty > ^ with equality. In 

other words, the diffusion kernels on the real line are exactly the signals that achieve the time-frequency 
uncertainty bound. 

This line of thought motivated us to consider a continuous-time diffusion process on graphs, governed 
by an equation analogous to ( |43] ): 

dx 

— = -Lx, (44) 
dt 

where L is the graph Laplacian. With the initial condition a;(0) = dug, the solution to (l44l ) is ||5l 

N 

x{t) = e-'^^Suo = e"*^" fiflK , (45) 

i=l 

where e~*^ is the matrix exponential of L, {Xi} are the eigenvalues of L, and {/j} are the corresponding 
eigenvectors. Denote by r]uo{s) the curve in the s-g plane traced out by the diffusion process. The curve 
can be given in parametric form as 

We show in Appendix |E] that s{t) is a strictly decreasing function of t; therefore it is one-to-one. 
Furthermore, s(0) = 1 and lim(_^oo'S(t) = 0. All together, this guarantees that the function rjuois) is 
well-defined for every s G (0, 1]. 

We plot in Figure [8] the diffusion curve rjua i^) and the uncertainty curve {s) for three different 
graphs: a random geometric graph [|381 that can capture the connectivity of wireless sensor networks; an 
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(a) Random geometric graph 



(b) Mesh graph 



(c) Small-world graph 





Fig. 8. Diffusion process versus the uncertainty curve for three types of graph, (a) A random geometric graph ll38i . (b) a 
triangular mesh j39j, and (c) a small-world graph 11401 . Below each graph, (d), (e), and (f) show the associated uncertainty 
curves (solid black line). A continuous-time diffusion process is run on each graph, beginning with an impulse at one vertex, 
and the resulting spreads are plotted (solid red line with circles). The circles are evenly spaced in time. The diffusion process 
tracks the curve closely, though close examination reveals that the match is not exact. 



unstructured triangular mesh|j for finite element analysis ||39l ; and a small- world graph POl that serves as 
the mathematical model for social and various other empirical networks. The geodesic distance function 
is used. In all three cases, the spreads of the diffusion process, though not exactly achieving the bounds 
as in the classical setting, match the uncertainty curves remarkably well. 

The following proposition, proved in Appendix |D] asserts that for certain special graphs the match 



'This graph was generated using the Mesh2D MATLAB toolbox written by Darren Engwirda, available online at MATLAB 
Central ( http://www. mathworks.com/matlabcentral/fileexchange/25555 1 ). 
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between r]uo{s) and 7«o(s) is exact. 
Proposition 4: For all s G (0, 1], r]^ 



7uo(s) if (a) G is a complete graph with N vertices and 



uq is any vertex; or (b) G is a star graph with N vertices and uq is the vertex with degree — 1. 

For general graphs we can show that, under certain conditions, the low-order derivatives of the 
uncertainty curve and the diffusion curve match. 

Proposition 5: Let G be any connected graph and uq be any vertex on G. Then r/„„(l) = 7Mo(f) = 0' 
= 0, and 



ds 



ds 



deg no 



s=l 



2 V i 

Z-^v^Uo d{v ,Uo)^ dcg V 

d{v,Uo)- 
deg Uq Z-/v^Uo deg V 



< 



ds^ 



s=l 



T ^— 

, Z-^v^uq deg V 

with equality if and only if d{v, uq) is identical for every v ~ 



2 ' 



Uq. 



(47) 



This proposition is proved in Appendix |E] It is easy to verify that the geodesic distance satisfies 
the condition required for equality in (147) . Extrapolating the observations in Figure [8] and results in 
Propositions |4] and |5] leads us to believe that diffusion kernels on arbitrary graphs will always be close to 
optimal in graph and spectral localizations. We leave further rigorous study of this tantalizing conjecture 
as an important line of future work. 



VI. Conclusions 

Analogous to the classical Heisenberg uncertainty principle in time-frequency analysis, an uncertainty 
principle for signals defined on graphs was developed in this work. After presenting quantitative definitions 
of the signal "spreads" in the graph and spectral domains, we provided a complete characterization of the 
feasibility region achieved by these two quantities. The lower boundary of the region, which is analogous 
to the classical uncertainty bound ([T}, was shown to be achieved by eigenvectors associated with the 
smallest eigenvalues of a particular matrix-valued function. Furthermore, the convexity of the uncertainty 
curve allows it to be efficiently approximated by solving a sequence of eigenvalue problems. We derived 
closed-form formulas of the uncertainty curves for complete graphs and star graphs, and developed a 
fast analytical approximation for the expected uncertainty curve for Erdos-Renyi random graphs. The 
localization properties of two existing wavelet transforms were evaluated. Finally, numerical experiments 
and analytical results led us to an intriguing connection between diffusion processes on graphs and the 
uncertainty bounds. 



AGASKAR AND LU: A SPECTRAL GRAPH UNCERTAINTY PRINCIPLE 



29 



Appendix 

A. The convexity ofT>uo 

We would like to prove that the set Du„ is convex as long as the number of vertices > 3. (The need 
for such a condition will be made clear shortly.) This is equivalent to showing the following result. 
Proposition 6: Suppose that there exist two vectors Xi, X2 in with > 3, such that 

xjxi = 1, xjLxi = Si, and xfP^^^Xi = gi, for z = 1,2. (48) 

Then for any ^ G [0, 1], we can always find a vector x in satisfying 

x^x = 1, x^Lx = s, and x'^P'^^x = g, (49) 

where s = /3si + (1 - f3)s2 and g = Pgi + (1 - P)g2- 

We will prove the above proposition by recasting the problem in Sym^, the Hilbert space of real, 
symmetric N x N matrices. The space is endowed with the Hilbert-Schmidt inner product defined by 
(A, B)hs == tv{A^B) = tr(AB), where tr(-) is the trace of a matrix. Every x £ can be mapped 
onto a matrix X = xx^ in Sym^. Finding a vector x satisfying the conditions in ( |49l ) then boils down 
to finding a rank-one positive semidefinite matrix X = xx^ satisfying the following three constraints 

tr(X) = 1, tr(ZX) = s and tr(P^^X) = g. (50) 

The requirement that X be a rank-one matrix makes this a hard problem, because the cone of rank-one 

matrices is not convex. Instead, we will use the following theorem to relax the problem to the cone of 

positive semidefinite matrices SS^, which is convex. 

Theorem 3 (Barvinok l{4Tj): Suppose that R > and N > R + 2. Let T-L C Sym^y be an affine 

/ R + 2\ 

subspace such that codim(?^) < I I . If the intersection SS'^ Dl-L is nonempty and bounded, then 



2 

there is a matrix X in SS^ n T-L such that rank(X) < R. 

Proof of Proposition |6l- First, we note that the three equalities in (ISOl l are all affine constraints on 
X. Together, they define a hyperplane H C Symjy with codim('H) < 3 = ( ^ )■ (In fact, /, L, and 



P^^ are linearly independent, so codim('H) = 3.) To apply Theorem |3l we verify next that SS^ nH is 
nonempty and bounded. 

First we show that it is bounded: let X be an arbitrary matrix in the intersection SS^ n V. (assuming 
one exists), and let {ui, U2, . ■ . , vn} be its eigenvalues. The equalities 1 = tr(X) = X]^=i ^n, together 
with the nonnegativity of the eigenvalues, imply that 

N N 



\X\gs=tT{X^) = ^l^l<^iyn = l. 



n=l n=l 
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Therefore, SS^ n ^ is a subset of the unit ball in SynijY and is thus bounded. 

To show that SS^ n H is nonempty, we explicitly construct a member of the set. Let xi, X2 be the 
two vectors satisfying (1481 ). On mapping the vectors to two matrices Xi = xixf and X2 = X2X2 , the 
constraints in (l48l ) can be rewritten as 

tv{Xi) = 1, tv{LXi) = Si and tv{Pl^Xi) = gi, for i = l,2. 

Xi and X2 are both in SS^. Now set X' = pXi + (1 - fi)X2. It is easy to see that X' G Ti and, 
because SSl is convex, X' G SS^ as well. To be sure, the matrix X' G SS^ n H is not necessarily 
of rank one. However, the result of Theorem |3] (for the case when R = 1) guarantees the existence of 
a rank one matrix X in SS^ n Ti. Decomposing this matrix as X = xx'^ and using the equivalence 
between (|49l l and (l50l l. we can conclude that the resulting vector x satisfies all the constraints in (|49] i. 

■ 

Remark: The above proof uses Theorem |3] for the case when R = 1. Consequently, we need to work 
with N > R + 2 = 3. This requirement is sharp in that the achievable region for a graph with two 
vertices (i.e., N = 2) is not convex. The only connected graph with = 2 is the complete graph. All 
unit-norm signals on this graph can be parametrized as (cos 0, sin 0). By computing the corresponding 
graph Laplacian and distance matrices, it is easy to show that the achievable region is only the boundary 
of an ellipse (not including its interior) and hence is not convex. 

B. Proof of Lemma\J\ 

(a) For any ai < 02, let vi and V2 be two unit-norm eigenvectors in SS{ai) and SS{a2), respectively. 
Applying Rayleigh's inequality, we get V2 M{ai)v2 > q{ai) = vf M{ai)vi Similarly, we have 
—v'2 M{a2)v2 > —vj M{a2)vi. A combination of these two inequalities leads to 

v'^ (M(qi) - M{a2)) V2 > vj (M(ai) - M{a2)) vi. (51) 

Recall that M{a) = PI_^^ - aL, and therefore M{ai) - M{a2) = {02 - ai)L. Replacing this identity 
into dST] ). we thus have 

rji rp 

V2 Lv2 > Vi Lvi. 

Note that vi and V2 can be arbitrary unit-norm elements in 55(01) and SS{a2), respectively. If, in 
particular, we choose vi,V2 to be those that attain the maximization in (|2TI ). we get /i+(a2) = V2LV2 > 
vJLvi = /i+(ai). Similarly, we can show that h^{a2) > /i_(ai). 
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(b) We will only consider the limits when a tends to — oo as given in (|23] ). The other case, when a 
tends to +00, can be analyzed in a similar way, and its proof will be omitted. Let q > be any positive 
number. By definition, 

/i+(a) > /i_(q) > 0, (52) 

where the second inequality is due to the Laplacian matrix L being positive semidefinite. Next, we show 
that /i+(a) can be made arbitrarily close to as a — )• —00. To that end, let v be any unit-norm eigenvector 
in SS{a), and fi be the first eigenvector of L as defined in Since SS{a) is associated with the 
smallest eigenvalue q{a), we have, from Rayleigh's inequality, 

v^{Pl^ - aL)v < fJiPl - aL)f, = f^Plj^, 

with the equality coming from the identity Lf^ = 0. For any a < 0, reamnging the above expression 
leads to 

v^Lv < -i {flPlj, - v^Plv) < (53) 

a a 

where the second inequality uses the bound of the graph spread as provided in Proposition [T] Since 
( l53l ) holds for any nonzero element v from SS{a), we must have /i+(a) < — <S^(no)/Q;, which, when 
combined with (l52l) . completes the proof. 

(c) First, using eigenvalue perturbation results, we will derive a candidate set A of points such that 
q{a) is certainly analytic on [a, b]\A. We will show that A is finite, so that the set of nonanalytic points 
of q{a) is finite as well. Then, we will compute and /i+(a) explicitly, and show that they are are 
left- and right-continuous, respectively, and that they are equal to the negative left- and right-derivatives 
of q{a), respectively. We will then show that = /i+(a) everywhere except a subset B A; 
therefore, they satisfy (l24l i. Since A is finite, it follows that B is finite as well. 

The starting point of our analysis is the following result. 

Proposition 7: There exist analytic functions Ai(-), . . . , AAr(-) and analytic vector-valued func- 
tions a;i(-), . . . , xn{-) such that 

M{a)xi{a) = Xi{a)xi{a), (54) 

and Xi{a)'^Xj{a) = 6ij. 

Proof: Standard perturbation results llBTl p. 404] guarantee the existence of such functions for any 
matrix function that is analytic and whose value is always Hermitian. The function M(-) as defined in 
([TS] ) is affine in a, and thus analytic; it is symmetric and real for every a, and thus Hermitian. Therefore 
functions with the properties listed in the proposition do exist. ■ 
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From Proposition |7J we can write q{a) as 



q{a) 



min A,' (a) 

l<i<N ^ ^ 



(55) 



where the {Aj(-)}j are the eigenvalue functions guaranteed by the proposition. For any G M, if 
<S'(ao) has dimension one, then precisely one of the eigenvalue functions is equal to q{-) at oq, say 
-^fc("o) = 9(0^0)- Pick some e < ^ minjyfc |Aj(ao) — Afc(ao)|. Since every Xj{-) is analytic, we can 
find some neighborhood M of for which \Xj{a) — Aj(ao)| < £ for every j. This guarantees that 
Afc(a) < Xj{a) on M for every j 7^ /c. Thus (7(a) = Xk{a) on A/". Since Afc(-) is analytic on M, we have 
that q{-) is analytic on M and therefore at oq- We can make this more general. Suppose instead of only 
one eigenvalue function attaining the minimum at ao, there are multiple eigenvalue functions [e.g., two, 
denoted by Afcj(-) and X^^i-)] that attain the minimum, and that they are all equal on a neighborhood 
M of oq. All the other eigenvalue functions are larger at ao. Again, the analyticity allows us to find a 
neighborhood M' C A/" on which all the other eigenvalue functions are larger than Ayfcj(-) = Xk^i')- Now, 
since q{a) = Xk^ia) = X^^ia), the function q{a) is analytic on M' as well. 

Thus, a necessary condition for q{-) to be nonanalytic at ao is that two (or more) distinct eigenvalue 
functions must intersect at ao- Define = 1, . . . , N' , N' < N as the set of distinct eigenvalue 

functions, and let rij be the multiplicity of the eigenvalue function fJ.j{-). Now consider an arbitrary finite 
interval [a, b] and define 



It is a well known property of analytic functions that if they are equal on more than a finite set of points 
in an interval, then they are identical. Since the /ii(-) are distinct analytic functions, no two of them 
can be equal on more than a finite set of points in [a,b]. Thus A is the finite union of finite sets, and 
therefore contains only a finite number of points 

Next, we connect q{a) to /i+(a) and /i_(a). At any point oq £ [o-jb], there can be A: > 1 distinct 
eigenvalue functions that achieve the minimum in (l55l) . Without loss of generality, we shall assume 
they are the first k functions, fii{-), . . . , fik{-)- The associated eigenvectors, Xij{ao), for i = l,...,k 
and j = 1,. . . ,ni, form an orthonormal basis for the eigenspace SS{ao). Any unit-norm element v S 
^^(ao) can then be written as v = Yli=i Y^]=i ^ij *ii(oo)> for some constant coefficients {cij} satisfying 





l<i<j<N' 




AGASKAR AND LU: A SPECTRAL GRAPH UNCERTAINTY PRINCIPLE 



33 



equality yields 

k Ui k rii 

M'{a)v{a) +M{a)v'{a) = ^ ^ Cij;u'j(a) a^jj (a) + ^ ^ Cjj7ij(Q)a;-j(a). (56) 

i=l j=l i=l j=l 

Evaluating (l56l ) at a = ao, pre-multiplying it by v'^{ao) and using the substitutions M'{a) = —L, 
M{ao)v{ao) = q{ao)v{ao), and f-ii{ao) = q{ao) for every i, we get 

k rii k rii 

-v^{ao)Lv{ao) + q{ao)v'^ {ao)v' (ao) = X] XI ^ii/^i("o) + q{ao) ^ ^ Cijv^ {ao)x[j{ao). (57) 

1=1 j=l i=l j=l 

The second terms on the left-hand and right-hand sides of (|57] ) are equal, leaving us with 

k rii 

v'^{ao)Lv{ao) = - X] X] 4^i(«o)- (58) 
i=i j=i 

By definition, /i+(ao) and /i_(ao) are the two extreme values of v^{ao)Lv{ao). Maximizing (and 
minimizing) the quantity in (1581 ) subject to the unit-norm constraint X]i=i Yl]=i cfj = 1, we have 

/i+(ao) = ™ax (— /i^(ao)) and /i_(qo) = min (— //^(qq)). (59) 

l<j<fc l<i<fc 

Now, there must exist some m, ^ G {1, . . . , /c} such that 

fi£{a) if a < ao 



(60) 

fJ.m{o() if a > ao 



on some neighborhood of ao, which can be chosen to be small enough that A^R^ = {ao} if ao £ A 
or TV n ^ = otherwise. We must have /i'^(ao) = mini<j<fc ^'^(ao), since if /ij(ao) < /x^(ao) for 
some j, then on a sufficiently small neighborhood of ao, we would have q{a) = fij{a) < ^im{ci) for 
a > ao, contradicting (l60]lj^ Meanwhile, away from ao there are no other points in J\f at which multiple 
distinct eigenvalue functions intersect. Thus, from (|59] l. we have that h^{a) = — /x^(a) on AAn [ao, oo). 
Since the /ii(-) are all analytic, /i+(a) is right-continuous at ao- Furthermore, since q{a) = firn{ct) on 
J\f n [ao,oo), /i+(ao) is equal to the negative right-derivative of q{a) at ao- By similar arguments, we 
can show that /i_(a) is left-continuous at ao and is equal to the negative left-derivative of q{a) at ao- 
A necessary condition for /i_(ao) ^ /i+(ao) is that k > 1, i.e., there are multiple distinct eigenvalue 
functions achieving the minimum in (|55]) . Thus the set of points B at which they differ satisfies ;B C ^, so 
B is finite. Meanwhile, if /i+(ao) = /i_(ao), then the equality must hold for all a € A/" as well because 

*The requirement that fi'^aiao) = niini<i<fc /i^(ao) might not always be sufficient to uniquely determine m, however. In the 
case that multiple distinct eigenvalue functions achieve the minimum derivative, j-im{-) is then determined by comparing the 
higher order derivatives. This nuance does not affect our proof, which only depends on the first derivative. 
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of the way we constructed the neighborhood J\f. Since /i_(a) is left-continuous and /i+(a) is right- 
continuous at ao, both functions are continuous at ao- EquaUty also means the left- and right-derivatives 
of q{a) are equal at oq, and thus (?'(ao) is well-defined with /i_|_(ao) = /i_(ao) = — ^'("o)- 



C. Proof of Proposition \3\ 

For N = 2 the proposition is trivial, so let us assume > 2. By Theorem [T] x must be an eigenvector 
associated with the smallest eigenvalue of M{a) = P^^ — aL for some a, where L and Puo are given 
by ( [27] ) and ( [28] ). respectively. M{a) is given in block form as 



-a 

Mia) 

a -1 

where B is the (A'^ — 1) x — 1) circulant matrix B 
{wi, . . . ,wi\]-2] be an orthonormal set of vectors in 
subspace of vectors in 



N-l^N-l 



B 

1 



N 



a]I 



. Let 



^ ^ such that Wi _L Iat-i. This set spans the 
orthogonal to Iat^i. It is easy to verify that Bwi = (1 — 



a Wi. 



Furthermore, if we set Vi = (0, wj) , then we can see that Vi are all eigenvectors of M{a) with 



eigenvalue 1 



N 



a. 



If we can show that this is not the smallest eigenvalue of M{a), i.e. that q{a) 7^ 1 — j^ct, then 
it follows that x [an eigenvector of M{a) corresponding to q{a)] must be orthogonal to every Vi for 
i = 1, . . . , N — 2. This will then guarantee that x is of the form ([29] ). 
To show that q{a) / I ^ 



N- 



—a, we let y = [yi, . . . ,yN] be chosen such that ||y|| = 1, yi ^ 0, and 



y^lN = 0. This last property makes y an eigenvector of L with eigenvalue jf^. We have y^ PtoV 



N 



.T r,2 



l-y'i and y' Ly = j^. Thus y' M{a)y = l-yt 



N 
N-1 



a <l 



N 
N-1 



a. It follows from the Rayleigh 



inequality that q{a) < y M{a)y < 1 



N 



jj^ct, proving the proposition. 
Remark: With small modifications, the above proof can be used to demonstrate that the same property 



holds for star graphs, i.e. any vector achieving the uncertainty curve must be of the form in (I29b . For a 
star graph with N vertices, we have 



M{a) 



-a 



(1 - a)lN-i 



(61) 



a -I 

7W=i^^-^ 

Again, there is an (N — 2)-dimensional eigenspace spanned by the same set {I'ililTL^ as in the complete 
graph case above. In this case, the eigenvalue associated with that subspace is 1 — a. Thus, to show that 
the smallest eigenvector is of the desired form, we must simply show that there is some unit norm vector 
y for which y'^M{a)y < 1 — a, guaranteeing that the eigenvector associated with the smallest eigenvalue 
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is orthogonal to the eigenspace spanned by {vi}^J[^. Our test vector here is y = (1,0, . . . ,0)^, which 
gives us M{a)y = —a < 1 — a, so the same property holds for the star graph as the complete graph. 



D. Proof of Proposition |4| 

(a) Let 1/]^, . . . , /jv} be an orthonormal basis of with /i = It is easy to verify that 
these are eigenvectors of L [given in (|27] )1 with corresponding eigenvalues Ai = and = for 
A; = 2, ...,7V. 

It follows from (1451 ) that the diffusion process starting from xq = 5uo can be obtained as 

x{t) = fjJSu, + e-^^\l - (62) 

Assuming without loss of generality that uq = 1 and using the fact that fi = -^1, we have x{t) = 
[l + (iV - l)e^^"*, 1 - e~^"*, . . . , 1 - e^'^-*]^. Using our knowledge of L and the fact that Pu^ = 
diag(0, !,...,!), it is now straightforward to compute the spreads as Al{x{t)) = ^ and 

^^{x{t)) = ^ ^ — ]y~ir2A~t ■ verify that these spreads satisfy (|32] i. Thus, for all t > 0, x{t) 

achieves the uncertainty curve. A^(cc(t)) is continuous and limt_j.oo A^(a;(t)) = 0, so r]uo{s) = lua{s) 
for s e (0,1]. 

(b) Here, we assume without loss of generahty that the center of the star, i.e., the vertex with degree A^— 

1 is tio = 1. Again, we explicitly construct an orthonormal eigenbasis for L, given in this case by ( [33] ). In 

what follows, we will assume that N > 2; the star graph with 2 vertices is the same as the complete graph 

r "1 ^ 

with 2 vertices, so the proof from (a) will apply to that case. Let fi 



N 



V2' V2(A^-1)'"'' y/2(N~l) 



T 

, and = [0, gl] ' iox k = 2, . . . , N - I, where {9^)^=1 is 



any orthonormal basis for M^~^ satisfying = It is easy to verify that {fk}k=i forrns an 

orthonormal basis for M^, and that the are eigenvectors of L with corresponding eigenvalues Ai = 0, 
A2 = • • • = Atv-i = 1, and Atv = 2. 

Similar to (|62] |. we can compute the diffusion process explicitly as 

x{t) = fifJSu, + e-\l - /i/f - fNfDSuo + e-^'fNflK. (63) 
= (1 - e-*) fjl6u, + - e-*) f^fld^, + e-*d„„. (64) 



Using the expressions for fi and /jy, we find that x{t) 



I _L lp-2t 1 (^ _ p-2t\ -.T 



l-e 



-2t\2 



From this, we can compute the graph spread as {^{t)) = — and the spectral spread 

^' ° 2(1 + e 
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as A'^{x(t)) = — It is easy to verify that these spreads satisfy (l34l i. and so x{t) achieves 
the uncertainty curve for t > 0. Once again, A^(a;(t)) is continuous and lim^^^oo A^(a;(t)) = 0, so 
Vuois) = luais) for s G (0,1]- 



Proof of Proposition \5\ 

We know from Theorem [T] that every point on the uncertainty curve is achieved by an eigenvector 
associated with the smallest eigenvalue q{a) of M{a) = — aL. In particular, the point (1,0) is 
achieved by Sug, which is the eigenvector associated with the matrix A^(0) = P^^^ and eigenvalue 
q{0) = 0. Since d{uQ,v) = if and only if uq = v and d{v,uo) > otherwise, the eigenspace 5(0) is 
one-dimensional. Thus, from the proof of Lemma [T] in Appendix |Bl there is some neighborhood M of 
a = on which S{a) is one-dimensional, and therefore q{a) is analytic. In this case, there exists some 
neighborhood of s = 1 for which we can use the parametric form of the uncertainty curve given in (|25T l. 
namely (s,7«o(s)) = {su{a), gu{a)) where s„(a) = -q'{a) and gu{a) = q{a) - aq'{a) for a^M. 

We can thus compute the derivative of the uncertainty curve parametrically as 

dluQ ^ ^ -aq"{a) ^ ^ ^^^^ 

ds s'^{a) -q"{a) 

where a is chosen so that s{a) is the argument at which we wish to evaluate the derivative. Similarly, 
the second derivative is 



,9 J- ( 9UA\ 



(66) 



ds'^ s'^{a) —q"{a) 

Both (l65l ) and (l66l ) require that q"{a) be nonzero. In what follows, we will explicitly compute g"(0) 
and show that q" {a) / for a € A/"', where J\f' C J\f. As described in the proof of Lemma [U there is 
an analytic eigenvector function v{a) defined in a neighborhood of a = such that 

M{a)v{a) = q{a)v{a), (67) 

with v{{)) = Suo and ||?;(q)|P = 1. The spectral spread function is given by Su(a) = v{a)Lv{a) = 
—q'{a), where the second equality is due to (1251 ). So we can compute 

q"{a) = -2v{a)Lv'{a). (68) 

To compute v'{a), we differentiate both sides of (|67l ) and after rearranging terms obtain 

[M{a) - q{a)I]v'{a) = Lv{a) + q'{a)v{a). (69) 
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From (|6T| | and the fact that S{a) is one-dimensional on J\f, M{a) — q{a)I has a one-dimensional 
nuUspace spanned by v{a). Since = = 2v{a)^v'{a), when we multiply both sides of (l69l) 

by the Moore-Penrose pseudoinverse of M{a) — q{a)I, we obtain 

v'{a) = [M{a) - q{a)I]+Lv{a), (70) 

where we have also used the fact that [M — q{a)I]'^v{a) = to simplify the right-hand side of (ITOl l. 

Setting a = and using the fact that q{0) = and v{0) = Su^, we have v'{0) = (P^ J L^^,, . 
Substituting this into m, we get q"{0) = -25l^L {pI^ LS^^ = "2 ^1 ^f^V "^^^'^ (^O- 



is the t;th entiy of L6uq- From the definition of the graph Laplacian, we have that for every v ~ uq, 

{L8uo)v = rr^ ^5=- Thus, 



derivatives (|65] ) and ( [66l ) are valid at s = 1, which corresponds to a = 0. We obtain 
the expression for '^^^2" 



Since the graph is connected, q"{0) / 0, and since q{a) is analytic on M, there exists a neighborhood 
M' C J\f containing on which q"{a) 7^ as well. Thus our expressions for the first and second 

= and 

s=l 

given in (l47l ). 

s=l 

To compute the derivatives of the curve rjugis) traced out by the diffusion process x{t), we express 
it parametrically in terms of t, with (s,r/„^(s)) = (sdit), gdit)) where Sdit) = and gait) = 

We first show that Sd{t) < 0. To simplify the computation of this and other derivatives, we introduce 
the function Rz{t) = "^^|*^(^|^2^*^ for any fixed matrix Z. It is easy to verify that since x(t) = —Lx{t), 
Rz{t) = '2Rz{t)RL{t) — RLz{t) — Rzhit), where the last two terms in the sum are equal if Z is 
symmetric. Since we have an explicit solution x{t) = e^^^Sug, we can see that ||a;(i)|| 7^ for all t, so 
that Rz{t) and its derivative is well-defined. 

Since Sd{t) = Rhit), we have Sd{t) = 2{sd{tf - RL^{t)) = 2 [{x{tfLx{t)f - x{tfL^x{t)\ < 
by the Cauchy-Schwarz inequality. Equality would hold only if Lx{t) were a multiple of x{t) — i.e., if 
x{t) were an eigenvector. From (l45T l we can see that this could only occur if 6ua itself were an eigenvector, 
which is impossible for a connected graph. We can directly evaluate 5^(0) = 1 and lim^^oo Sd{t) = 0; 
combining this with the above result guarantees that Sd{t) is a one-to-one function with range (0,1]. 
Thus i]ua (•5) is well-defined on that domain. 

Since gd{t) = Rp^ (t), we can compute the derivative gd{0) = 5d(0)srf(0) — Rlp^ (0) = 0. Thus the 
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diffusion curve's derivative at s = 1 is given by 



dr], 



Uo 



ds 



9d{t) 







d^, 



s=l 



^ Sd{t) " ds 
Meanwhile, we can simplify the second derivative evaluated at s = 1, obtaining 

d^^u,, 9diO) 



ds' 



s=l 



The first derivative of Sd{t) at t = can be computed as 

= 2{l-\\LSuJ\')=-2Y, 

vr^Ua 



1 



deg uq deg v 



The second derivative of gait) is 

9d{t) = '^9d{t)sd{t) + 2gd{t)sd{t) - 4sd{t)RLP- {t) + 2RL-p2 {t) + 2RLP- ^(t) 

UQ UQ UQ 

At i = 0, the only nonzero term in (|75) is the last one: 

ifd(O) = 2Hi,p. ,(0) = 2<S^ iPj, L<S„. 



V^Uo 



deg Uo deg v 



(72) 



(73) 



(74) 



(75) 



(76) 



s=l 



Now we can combine ( 1731) . ( 1741 ). and (1761 ) to obtain the expression for 

Cauchy-Schwartz inequality, (Zv^uo dk^) < {Zv^u^ ^fe 
and only if d{v, uq)^ = c for every v ~ uq, where c is some constant. Comparing the expressions for the 



given in ( |471 ). By the 
j7 wa I with equality if 



second derivatives of the uncertainty curve and diffusion curve, we can see that -g^ 
with equality if and only if d{v,UQ) is identical for every v ~ uq. 



> 



s=l 



ds^ 
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